Dynamic Transcription for Low-Latency Speech Translation

نویسندگان

Jan Niehues

Thai Son Nguyen

Eunah Cho

Thanh-Le Ha

Kevin Kilgour

Markus Müller

Matthias Sperber

Sebastian Stüker

Alexander H. Waibel

چکیده

Latency is one of the main challenges in the task of simultaneous spoken language translation. While significant improvements in recent years have led to high quality automatic translations, their usefulness in real-time settings is still severely limited due to the large delay between the input speech and the delivered translation. In this paper, we present a novel scheme which reduces the latency of a large scale speech translation system drastically. Within this scheme, the transcribed text and its translation can be updated when more context is available, even after they are presented to the user. Thereby, this scheme allows us to display an initial transcript and its translation to the user with a very low latency. If necessary, both transcript and translation can later be updated to better, more accurate versions until eventually the final versions are displayed. Using this framework, we are able to reduce the latency of the source language transcript into half. For the translation, an average delay of 3.3s was achieved, which is more than twice as fast as our initial system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Evaluation Method for Speech Translation Systems and a Case Study on ATR-MATRIX from Japanese to English

ATR-MATRIX is a multi-lingual speechto-speech translation system designed to facilitate communications between two parties of different languages engaged in a spontaneous conversation in a travel arrangement domain. In this paper, we propose a new evaluation method for speech translation systems. Our current focus is on measuring the robustness of a language translation sub-system, with quick c...

متن کامل

Lecture Translator - Speech translation framework for simultaneous lecture translation

Foreign students at German universities often have difficulties following lectures as they are often held in German. Since human interpreters are too expensive for universities we are addressing this problem via speech translation technology deployed in KIT’s lecture halls. Our simultaneous lecture translation system automatically translates lectures from German to English in real-time. Other s...

متن کامل

High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation

Highly accurate speech recognition with very low latency is a big challenge but also an important requirement for modern real-time speech recognition applications such as speech-to-speech translation. We attack this problem by proposing a highly effective and efficient streaming mode decoding scheme. A novel multi-layered feature streaming method is introduced to minimize truncation errors duri...

متن کامل

Role of pausing in text-to-speech synthesis for simultaneous interpretation

The goal of simultaneous speech-to-speech (S2S) translation is to translate source language speech into target language with low latency. While conventional speech-to-speech (S2S) translation systems typically ignore the source language acousticprosodic information such as pausing, exploiting such information for simultaneous S2S translation can potentially aid in the chunking of source text in...

متن کامل

Low-latency incremental speech transcription in the synface project

In this paper, a real-time decoder for low-latency online speech transcription is presented. The system was developed within the Synface project, which aims to improve the possibilities for hard of hearing people to use conventional telephony by providing speech-synchronized multimodal feedback. This paper addresses the specific issues related to HMM-based incremental phone classification with ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Dynamic Transcription for Low-Latency Speech Translation

نویسندگان

چکیده

منابع مشابه

A New Evaluation Method for Speech Translation Systems and a Case Study on ATR-MATRIX from Japanese to English

Lecture Translator - Speech translation framework for simultaneous lecture translation

High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation

Role of pausing in text-to-speech synthesis for simultaneous interpretation

Low-latency incremental speech transcription in the synface project

عنوان ژورنال:

اشتراک گذاری